
\ 



SEQUENCE LISTING 



<110> Consejo Superior de Investigaciones Cientificas 
<120> New Gene 
<130> DGB/DE/PCT131 

<140> PCT/GB99/03019 
<141> 1999-09-10 

<150> SE 9803069.5 
<151> 1998-09-10 

<150> US 60/100,873 
<151> 1998-09-17 

<160> 4 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 2610 

<212> DNA 

<213> Homo sapiens 

<220> , . 

<223> This gene is referred to in the application as 

Human Death Inducer Obliterator Gene 1. It has now 

been named by the Human Gene Nomenclature 

Committee as Death-Associated Transcription 

Factor (DATF-1) 

rtcgqtggcc gtccgcccac tccgcggcgt tcggggaaat ggctgcgaga ccctagaggc 60 
ctgcgglgct ?actccacgg gaacagcctc tagataatct gagttgttga aaatacgaag 20 
cc?gttactc gtgaacagtg gctgacaaca gtgttgttgt gagcctggct gtctgcttgg 180 
acccagaggt ttcgtctgcc agggtttttg gttgtattta ggatttcagg gaaaagtgtc 240 
caagctttca gtgttggagc aggtatggac gacaaaggcg acccgagcaa tgaggaggca 300 
cctaaggcca tcaaacccac cagcaaagag ttcaggaaaa catggggttt tcgaaggacc 360 
actatcgcca agcgagaggg cgcaggggac gcggaggctg acccactgga sccgccaccc 420 
ccacagcagc agctgggcct gtccctgcgg cgcagtggga ggcagcccaa ^gcactgag 480 
cgcgtggagc agttcctgac cattgcgcgg cgccgcggca ggaggagcat gcctgtctcc 540 
ctggaggatt ctggtgagcc cacgtcctgc cccgccacag acgccgagac agcctccgag 600 
ggHgcgtgg aaagcgcttc tgagaccaga agcggccccc agtctgcttc cacagctgtg 660 
aaggaacgac cagcctcttc tgaaaaggtg aaaggagggg atgaccacga tgacacctcc 720 
gatagtgaca gcgatggcct gaccttgaaa gagcttcaga atcgccttcg "ggaagcgg 780 
gaacaggagc ccactgagag gcccctgaaa gggatccaga gtcgcctgcg gaagaagcgc 840 
cgggaggagg gtcccgccga gactgtgggc tccgaggcca gtgacactgt ggagggcgtc 900 
c?gcccag?a agcaggagcc cgagaacgat cagggggttg tgtcccaggc tgggaaagat 60 
gacagaglga gtaagttgga gggaaaggcg gctcaggaca tcaaagatga ^agcctgga 1020 
gacttgggcc gaccgaagcc tgaatgtgag ggttacgacc ccaacgccct gtattgcatt 1080 
tgccgccagc ctcacaacaa caggtttatg atttgctgtg accgctgtga agaatggttt 1140 
catggcgatt gtgtgggcat ttctgaggct cgagggaggc ttttggaaag gaatggggaa 1200 
gac?atatct gclcaaactg caccattctg caagtgcagg atgagactca ttcagaaacg 1260 
gcagatcagc aggaagctaa atggagacct ggagatgctg atggcaccga ttgtacaagt 1320 



ataggaacaa 
gctgcaaatc 
ggtgcctcaa 
tgcagtaatg 
aaagaacaga 
ccgaaatgcg 
gaaaaaaaag 
gggaaggaag 
gcagtaaagc 
cacctagggg 
tgtccaggac 
cagcgggaag 
tgtcaggact 
taatgattga 
cttcatccca 
cacatgagtt 
ccagcacatg 
catctgattt 
tccatgtccg 
actcctgctt 
aggacgcgtg 
gaatgtttag 



tagagcagaa 
caagtggcaa 
aatgtattgg 
actgtatcct 
agccaaagcc 
gtgctcaggc 
agaccacagt 
cagcttgtga 
cagaaaagac 
ttggcctcct 
tgggagttgc 
cagagcccat 
tcctgtcact 
ttctttaatc 
gtctgattgc 
gttctgacaa 
aaagcatcac 
tcaggtgtac 
attcacacgc 
cataggcccc 
ccttggactt 
actcttaaga 



gtctagcgaa 
gaagaaactc 
ccccgggtgc 
caaacacgcc 
taaagaaaag 
aggtattaaa 
gaagaaggca 
gagcagcacg 
tgctgctccc 
ggacccctcc 
agctttgtgt 
gtccaggatg 
gttttccaaa 
tctgttttca 
atagccacac 
cagcgctgta 
ttctttttta 
atttattttt 
tacacaccca 
acggtaagtg 
ggtattctac 
tcatatcctg 



gaccaaggga 
aagatcttcc 
tgtcacgtgg 
gcagcgacaa 
atgaagatga 
atctcttctg 
gtggtggtcc 
ccgtcgtggg 
tcgccgtcac 
cgttctttct 
taagctgatc 
cctcctgctg 
gctgtaaacc 
ctctcaggct 
tgcccggcac 
tacgcttcag 
tgttgtggga 
gactgggcag 
catgaacaca 
agttcacacc 
atgtgactgg 



taaagggtag 
agcctgtgat 
cgcagcccga 
tgaagtttct 
agccagagaa 
tgcacaagag 
ctgcgcggag 
cgagcgatca 
tgttgtataa 
ggatagccat 
acagacaccg 
ccctgtgtcc 
tcactggtga 
ctggtaagta 
gccacatcca 
tttttccaca 
atctttgcaa 
ataggggatt 
ttcgaacttc 
tagaacactg 
ctttcttgcc 



aattgagaaa 
agaggcgcct 
ctcggtgtac 
aagctcaggt 
gcccagtctt 
accagctcca 
tgaagcactc 
caattacaat 
atgtatgtat 
cccctgggcc 
gctgcaccat 
atccctagtc 
acgttcacct 
tttgtattct 
cccctgtctg 
ttgtccacgg 
gttagtgttg 
tttttttttt 
gaaggcacac 
tcctgaccgc 
ctcgtctctt 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2610 



<210> 2 
<211> 2867 
<212> DNA 
<213> Murine 



<400> 2 

tccgtggtag 

cagtccccac 

actgacaaca 

tatgtaggat 

aaagggcacc 

aggaaaacct 

gaggcggacc 

cggcaaccaa 

aaaaagaatg 

gatgtggaga 

gtatctgact 

gaggaagaag 

cgccttcgga 

cgcctgagga 

agtgccgagc 

tcccagtcag 

acagaggaaa 

aatgccctgt 

cggtgtgagg 

ctggaaagga 

gagacaaacg 

ggcacagact 

aagggtagga 

cctgtcgtag 

cagcctgact 

agatttctaa 

ccagaaaagt 



ctctggaaat 
tgtccctttg 
gtggggtgag 
ttccagccaa 
tgagcaatga 
ggggttttcg 
ccagtgagca 
aacgtactga 
tgccggtgtc 
cagcttccga 
ccttagggaa 
acacctctga 
gaaagcgaga 
agaagcgcag 
aggacagacc 
agacagatga 
accccaggga 
actgcatctg 
agtggttcca 
acggggaaga 
gtagcgccac 
gcacaagcat 
ttgagaaggc 
aggctcctgg 
ctgtgtattg 
gttcaggtaa 
tcagtcttcc 



ggctgcggga 
gttgttcgaa 
gcttggccgt 
aggtttccaa 
ggaagcaccc 
aagaaccacg 
gcaaccacag 
gagggtagaa 
cctggaggat 
ggggagcgtt 
agaacatcct 
cagtgacagt 
gcaagaacct 
agaggaagat 
tctctgtaag 
catagaaaat 
agcgggcaaa 
ccgccagcct 
tggtgactgt 
ctacatctgc 
caatgagcag 
agggacagta 
agcaaacccc 
tgctcctaaa 
cagtaatgac 
agaacaaaaa 
aaaatgcagt 



tcccggcggc 
gctccggaat 
ctgcttgacc 
gctttcagtt 
aaggctatca 
attgccaaac 
cagcataacc 
gagtttctta 
tccagtgagc 
gaaagcagtt 
gcctcttctg 
gatggcctta 
gtggagaggt 
tctgccgaaa 
caggagcctg 
cagttggaag 
ccaaagcctg 
cacaacaaca 
gtgggtattt 
ccaaattgca 
gactctgggt 
gagcagaagt 
agcggcaaga 
tgcattggcc 
tgcattctca 
acaaaaccca 
gttcaggtgg 



cggggagctt 
cttctcattg 
tggccccagg 
ttgggacagg 
aacccaccag 
gtgagggtgc 
tctccctgcg 
ccacggttcg 
ccacatcttc 
ctgagatcag 
aaaaggcaaa 
cgttgaagga 
ccctgagagg 
ctgggagtgt 
aggctagtca 
ggaaggcgac 
agtgtgaggt 
ggtttatgat 
ctgaggcccg 
ccattttgca 
gcagatctgt 
ccggagaaga 
aaaaactcaa 
ctgggtgttc 
aacacgcagc 
aggaaaaggt 
ggattaaaat 



gtttaagagg 
atgaactgtg 
tctataattt 
tatggatgat 
taaggagttc 
aggagacacg 
ccgcagtgga 
gcgccgaggg 
cacagtcact 
aagtggccct 
aggaggtgaa 
acttcagaac 
cagtcagaat 
ccaaataggc 
gggaccagtg 
tcagggaaat 
ttacgacccc 
ctgctgtgat 
agggcggctc 
agtgcaggat 
gggtgctgat 
ccagggcata 
gatattccag 
cagtgtagca 
agctaccatg 
caagacgaag 
ctcttctgtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 



cacaagagac tagcgtcaga gaaaagggaa aacccagtga agaaagtgat gctggcttcc 1680 
aggag?gaga ct?c?gggaa ggaggcagcc tgtgagagca gcacaccatc ctgggcaagt 
qaccacaact acaatgctgt gaagccagag aagccagaga agcccactgc actctcgccc 18UU 
accSattga gtaaatgtac gtatcaccca aaggctggct tcccaggccc ctcccatcat 1860 
ctgggJggct gcctggggct gtctaggacc agagtcctgg gtgttctggt gctgatagta 1920 
gccagcagct cactgccagc cagaagcaga taccaagatg cctctggacc ccaggtgttc 1980 
c^gcctagcc tgtggagcct ctctgggtgg ttcctaaaga gctgtgtagg cctcatgttg 2040 
qaqgcaa?tt cttatttcag ttttaggcct tggtaagtat ttgaactgct ctcaacaaga 2100 
?gggacaaca tcagtgcac? gttcagactc agttcagact tgagttcctc acaggacagc 210 
ccagcacaaa ccatggagtt ttcccacgtt acctattttt cttcaacatg gtcaccactg 2220 
etc tcta" gaaaagtgat ttttttcatg agttagggca tttttgaaag gcaggtaga 22 0 
caatttgtgt ctatttcaca tactacacac ctacataaac aggtttgaat tttgaaggtc 2340 
a?ctgc?gct cagatcccat ggtgagtaat cgtgctgacc atacacctac agcagtattc 2 00 
tcatcaaagt gagaattgtg ggtggtagct tccagactcg tagatgcccc agatttcaaa 2460 
tcaqtqttta tagtttgagt aaacttttga aacccaagat tagatcactg agtttggagc 2b^u 
caScSgtt teactgetgt acaaggtaca cctttcttac tttttgtaca gtggtttcaa 2580 
gtgaattttc atctgtgttc tcatgatagt gttttgttcc atctctgtcc accctcagcc 2640 
ccagataagg gggtctcatg gectaagctg gecttaaatt ttctgtaaag ttgggggtcc 2700 
tXccccc acctgtaaag agtgctgaaa cttacaggca cacacctgat ? tatgctgtg 2760 
gggtgtgaac tcagagtgtt ggaccagcag tctaccagct gagctgeagt tetagecatg 2820 
gtaactgaaa ctgetgatet tcttgtcttt gtttcctttg tgctggg 2867 



<210> 3 
<211> 562 
<212> PRT 

<213> Homo sapiens 
<400> 3 



Met Asp Asp Lys Gly Asp Pro Ser Asn Glu Glu Ala Pro Lys Ala lie 
1 5 10 

Lys Pro Thr Ser Lys Glu Phe Arg Lys Thr Tr P Gly Phe Arg Arg Thr 



20 



Thr lie Ala Lys Arg Glu Gly Ala Gly Asp Ala Glu Ala Asp Pro Leu 



35 



40 



45 



Glu Pro Pro Pro Pro Gin Gin Gin Leu Gly Leu Ser Leu Arg Arg Ser 



50 



55 



60 



Gly Arg Gin Pro Lys Arg Thr Glu Arg Val Glu Gin Phe Leu Thr lie 
65 70 75 

Ala Arg Arg Arg Gly Arg Arg Ser Met Pro Val Ser Leu Glu Asp Ser 
85 90 

Gly Glu Pro Thr Ser Cys Pro Ala Thr Asp Ala Glu Thr Ala Ser Glu 
100 105 110 

Glv Ser Val Glu Ser Ala Ser Glu Thr Arg Ser Gly Pro Gin Ser Ala 
115 120 125 

Ser Thr Ala Val Lys Glu Arg Pro Ala Ser Ser Glu Lys Val Lys Gly 
130 135 140 

Gly Asp Asp His Asp Asp Thr Ser Asp Ser Asp Ser Asp Gly Leu Thr 



145 



150 



155 



160 



Leu Lys Glu Leu Gin Asn Arg Leu Arg Arg Lys Arg Glu Gin Glu Pro 
165 170 

Thr Glu Arg Pro Leu Lys Gly He Gin Ser Arg Leu Arg Lys Lys Arg 

180 185 
Arg Glu Glu Gly Pro Ala Glu Thr Val Gly Ser Glu Ala Ser Asp Thr 



195 



200 



Val Glu Gly Val Leu Pro Ser Lys Gin Glu Pro Glu Asn Asp Gin Gly 

215 220 



210 



Val Val Ser Gin Ala Gly Lys Asp Asp Arg Glu Ser Lys Leu Glu Gly 



235 



225 230 

Ala Ala Gin Asp He Lys Asp Glu Glu Pro Gly Asp Leu Gly Arg 



Lys 



245 



250 



Pro Lys Pro Glu Cys Glu Gly Tyr Asp Pro Asn Ala Leu Tyr Cys lie 
260 265 210 

Cys Arg Gin Pro His Asn Asn Arg Phe Met lie Cys Cys Asp Arg Cys 
275 280 285 

Glu Glu Tr P Phe His Gly Asp Cys Val Gly He Ser Glu Ala Arg Gly 



290 



295 



300 



Arg Leu Leu Glu Arg Asn Gly Glu Asp Tyr lie Cys Pro Asn Cys Thr 
305 310 315 

lie Leu Gin Val Gin Asp Glu Thr His Ser Glu Thr Ala Asp Gin Gin 



325 



Glu Ala Lys Trp Arg Pro Gly Asp Ala Asp Gly Thr Asp Cys Thr Ser 



340 



345 



lie Gly Thr lie Glu Gin Lys Ser Ser Glu Asp Gin Gly He Lys Gly 



355 



360 



Arg lie Glu Lys Ala Ala Asn Pro Ser Gly Lys Lys Lys Leu Lys lie 

375 380 



370 



Phe Gl 
385 



n Pro Val lie Glu Ala Pro Gly Ala Ser Lys Cys He Gly Pro 
390 395 400 



Gly Cys Cys His Val Ala Gin Pro Asp Ser Val Tyr Cys Ser Asn Asp 
Cys lie Leu Lys His Ala Ala Ala Thr Met Lys Phe Leu Ser Ser Gly 



420 



425 



Lys Glu Gin Lys Pro Lys Pro Lys Glu Lys Met Lys Met Lys Pro Glu 

* - ^ 4 4b 



435 



440 



Lys Pro Ser Leu Pro Lys Cys Gly Ala Gin Ala Gly He Lys He Ser 



450 

Ser Val His Lys Arg 
465 

Lys Ala Val Val Val 
485 



Ala Cys Glu Ser Ser 
500 

Ala Val Lys Pro Glu 
515 

Lys Cys Met Tyr His 
530 

Phe Trp He Ala He 
545 

Leu Cys 



455 

Pro Ala Pro Glu Lys 
470 

Pro Ala Arg Ser Glu 
490 

Thr Pro Ser Trp Ala 
505 

Lys Thr Ala Ala Pro 
520 

Leu Gly Val Gly Leu 
535 

Pro Trp Ala Cys Pro 
550 



460 

Lys Glu Thr Thr Val Lys 
475 480 

Ala Leu Gly Lys Glu Ala 
495 

Ser Asp His Asn Tyr Asn 
510 

Ser Pro Ser Leu Leu Tyr 
525 

Leu Asp Pro Ser Arg Ser 
540 

Gly Leu Gly Val Ala Ala 
555 560 



<210> 4 
<211> 614 
<212> PRT 
<213> Murine 



<400> 4 



Met Asp Asp Lys Gly His Leu Ser Asn Glu Glu Ala Pro Lys Ala lie 
1 5 10 15 



Lys 



Pro Thr Ser Lys Glu Phe Arg Lys Thr Trp Gly Phe Arg Arg Thr 



20 



25 



30 



Thr lie Ala Lys Arg Glu Gly Ala Gly Asp Thr Glu Ala Asp Pro Ser 
35 40 45 

Glu Gin Gin Pro Gin Gin His Asn Leu Ser Leu Arg Arg Ser Gly Arg 
50 55 60 

Gin Pro Lys Arg Thr Glu Arg Val Glu Glu Phe Leu Thr Thr Val Arg 
65 70 75 80 

Arq Arg Gly Lys Lys Asn Val Pro Val Ser Leu Glu Asp Ser Ser Glu 
85 90 95 

Pro Thr Ser Ser Thr Val Thr Asp Val Glu Thr Ala Ser Glu Gly Ser 
100 105 HO 

Val Glu Ser Ser Ser Glu lie Arg Ser Gly Pro Val Ser Asp Ser Leu 
115 120 125 

Glv Lys Glu His Pro Ala Ser Ser Glu Lys Ala Lys Gly Gly Glu Glu 
130 135 140 



t 



Glu Glu Asp Thr Ser Asp Ser Asp Ser Asp Gly Leu Thr Leu Lys Glu 
145 150 155 160 

Leu Gin Asn Arg Leu Arg Arg Lys Arg Glu Gin Glu Pro Val Glu Arg 
165 170 175 

Ser Leu Arg Gly Ser Gin Asn Arg Leu Arg Lys Lys Arg Arg Glu Glu 
180 185 190 

Asp Ser Ala Glu Thr Gly Ser Val Gin He Gly Ser Ala Glu Gin Asp 
195 200 205 

Arq Pro Leu Cys Lys Gin Glu Pro Glu Ala Ser Gin Gly Pro Val Ser 
210 215 220 

Gin Ser Glu Thr Asp Asp He Glu Asn Gin Leu Glu Gly Lys Ala Thr 

225 230 235 240 



Gl 



n Gly Asn Thr Glu Glu Asn Pro Arg Glu Ala Gly Lys Pro Lys Pro 

255 



245 



250 



Glu Cys Glu Val Tyr Asp Pro Asn Ala Leu Tyr Cys He Cys Arg Gin 
ocn 265 270 



260 



Pro His Asn Asn Arg Phe Met He Cys Cys Asp Arg Cys Glu Glu Trp 
275 280 285 

Phe His Gly Asp Cys Val Gly He Ser Glu Ala Arg Gly Arg Leu Leu 
290 295 300 

Glu Arg Asn Gly Glu Asp Tyr He Cys Pro Asn Cys Thr He Leu Gin 
305 310 315 320 

Val Gin Asp Glu Thr Asn Gly Ser Ala Thr Asn Glu Gin Asp Ser Gly 
325 330 335 

Cys Arg Ser Val Gly Ala Asp Gly Thr Asp Cys Thr Ser He Gly Thr 
340 345 350 

Val Glu Gin Lys Ser Gly Glu Asp Gin Gly He Lys Gly Arg He Glu 
355 360 365 

Lys Ala Ala Asn Pro Ser Gly Lys Lys Lys Leu Lys He Phe Gin Pro 
370 375 380 

Val Val Glu Ala Pro Gly Ala Pro Lys Cys He Gly Pro Gly Cys Ser 
385 390 395 400 

Ser Val Ala Gin Pro Asp Ser Val Tyr Cys Ser Asn Asp Cys He Leu 
405 410 415 

Lys His Ala Ala Ala Thr Met Arg Phe Leu Ser Ser Gly Lys Glu Gin 
420 425 430 

Lys Thr Lys Pro Lys Glu Lys Val Lys Thr Lys Pro Glu Lys Phe Ser 
435 440 445 



Leu Pro Lys Cys 
450 

Lys Arg Leu Ala 
465 

Leu Ala Ser Arg 



Ser Thr Pro Ser 
500 

Glu Lys Pro Glu 
515 

Cys Thr Tyr His 
530 

Gly Gly Cys Leu 
545 

Leu He Val Ala 



Ala Ser Gly Pro 
580 

Trp Phe Leu Lys 
595 

Phe Ser Phe Arg 
610 



Ser Val Gin Val 
455 

Ser Glu Lys Arg 
470 

Ser Glu Thr Ser 
485 

Trp Ala Ser Asp 



Lys Pro Thr Ala 
520 

Pro Lys Ala Gly 
535 

Gly Leu Ser Arg 
550 

Ser Ser Ser Leu 
565 

Gin Val Phe Leu 



Ser Cys Val Gly 
600 

Pro Trp 



Gly He Lys He 
460 

Glu Asn Pro Val 
475 

Gly Lys Glu Ala 
490 

His Asn Tyr Asn 
505 

Leu Ser Pro Thr 



Phe Pro Gly Pro 
540 

Thr Arg Val Leu 
555 

Pro Ala Arg Ser 
570 

Pro Ser Leu Trp 
585 

Leu Met Leu Glu 



Ser Ser Val His 



Lys Lys Val Met 
480 

Ala Cys Glu Ser 
495 

Ala Val Lys Pro 
510 

Leu Leu Ser Lys 
525 

Ser His His Leu 



Gly Val Leu Val 
560 

Arg Tyr Gin Asp 
575 

Ser Leu Ser Gly 
590 

Ala He Ser Tyr 
605 



<210> 5 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 5 

aggtgtacct tgtacagcag tgaaac 



<210> 6 

<211> 16 

<212> PRT 

<213> Artificial 

<220> 

<223> Additional N terminal cysteine for addition to a pepti 
corresponding to amino acids 58-72 of murine DIO-1 



<400> 6 



Cys Ser Leu Arg Arg Ser Gly Arg Gin Pro Lys Arg Thr Glu Arg Val 
15 10 15 



1'. 

I 



Alignment 1 

i 50 

KIAA0333 

SEQ_ID_No3 MDDKGDPSNE EAPKAIKPTS KEFRKTWGFR RTTIAKREGA GDAEADPLEP 
Consensus 

51 100 

KIAA0333 

SEQ_ID_No3 PPPQQQLGLS LRRSGRQPKR TERVEQFLTI ARRRGRRSMP VSLEDSGEPT 
Consensus 

101 150 

KIAA0333 

SEQ_ID_No3 SCPATDAETA SEGSVESASE TRSGPQSAST AVKER PASSE KVKGGDDHDD 
Consensus 

151 200 

KIAA0333 

SEQ_ID_No3 TSDSDSDGLT LKELQNRLRR KREQEPTERP LKGIQSRLRK KRREEGPAET 
Consensus 

201 250 
KIAA0333 ESKLEG KAAQDIKDEE 

SEQ ID No3 VGSEASDTVE GVLPSKQEPE NDQGWSQAG KDDRESKLEG KAAQDIKDEE 
Consensus ESKLEG KAAQDIKDEE 

251 300 
KIAA0333 PGDLGRPKPE CEGYDPNALY CICRQPHNNR FMICCDRCEE WFHGDCVGIS 
SEQ ID No3 PGDLGRPKPE CEGYDPNALY CICRQPHNNR FMICCDRCEE WFHGDCVGIS 
Consensus PGDLGRPKPE CEGYDPNALY CICRQPHNNR FMICCDRCEE WFHGDCVGIS 

301 350 

KIAA0333 EARGRLLERN GEDYICPNCT ILQVQDETHS ETADQQEAKW RPGDADGTDC 

SEQ ID_No3 EARGRLLERN GEDYICPNCT ILQVQDETHS ETADQQEAKW RPGDADGTDC 

Consensus EARGRLLERN GEDYICPNCT ILQVQDETHS ETADQQEAKW RPGDADGTDC 

351 400 
KIAA0333 TSIGTIEQKS SEDQGIKGRI EKAANPSGKK KLKIFQPGPG PVPTQLPVLW 

SEQ_ID_No3 TSIGTIEQKS SEDQGIKGRI EKAANPSGKK KLKIFQP 

Consensus TSIGTIEQKS SEDQGIKGRI EKAANPSGKK KLKIFQP 

401 450 

KIAA0333 QVLEIAVSRS ISAFTLLHCI SCKVIEAPGA SKCIGPGCCH VAQPDSVYCS 

SEQ ID No3 VIEAPGA SKCIGPGCCH VAQPDSVYCS 

Consensus VIEAPGA SKCIGPGCCH VAQPDSVYCS 

451 500 

KIAA0333 NDCILKHAAA TMKFLSSGKE QKPKPKEKMK MKPEKPSLPK CGAQAGIKIS 

SEQ ID No3 NDCILKHAAA TMKFLSSGKE QKPKPKEKMK MKPEKPSLPK CGAQAGIKIS 

Consensus NDCILKHAAA TMKFLSSGKE QKPKPKEKMK MKPEKPSLPK CGAQAGIKIS 

501 550 
KIAA0333 SVHKRPAPEK KETTVKKAW VPARSEALGK EAACESSTPS WASDHNYNAV 
SEQ ID No3 SVHKRPAPEK KETTVKKAW VPARSEALGK EAACESSTPS WASDHNYNAV 
Consensus SVHKRPAPEK KETTVKKAW VPARSEALGK EAACESSTPS WASDHNYNAV 

551 600 

KIAA0333 KPEKTAAPSP SLLYKSTKE- DRRSEEKAAA TAASKKTAPP GSTVGKQPAP 

SEQ_ID_No3 KPEKTAAPSP SLLYKCMYHL GVGLLDPSRS FWIAIPWACP GLGVAALC 

Consensus KPEKTAAPSP SLLYKcmke . drrle#kaaa faaaiktAcP GlgVaaqc. 
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Alignment 1 



KIAA0333 
SEQ_ID_No3 
Consensus 



KIAA0333 
SEQ_ID_No3 
Consensus 



KIAA0333 
SEQ_ID_No3 
Consensus 



KIAA0333 
SEQ_ID_No3 
Consensus 



KIAA0333 
SEQ_ID_No3 
Consensus 



KIAA0333 
SEQ_ID_No3 
Consensus 



KIAA0333 
SEQ_ID_No3 
Consensus 



601 650 
RNLVPKKSSF ANVAAATPAI KKPPSGFKGT IPKRPWLSAT PSSGASAARQ 



651 700 
AGPAPAAATA ASKKFPGSAA LVGAVRKPW PSVPMASPAP GRLGAMSAAP 



701 750 
SQPNSQIRQN IRRSLKEILW KRVNDSDDLI MTENEVGKIA LHIEKEMFNL 



751 800 
FQVTDNRYKS KYRSIMFNLK DPKNQGLFHR VLREEISLAK LVRLKPEELV 



801 8 50 
SKELSTWKER PARSVMESRT KLHNESKKTA PRQEAI PDLE DSPPVSDSEE 



851 900 
QQESARAVPE KSTAPLLDVF SSMLKDTTSQ HRAHLFDLNC KICTGQVPSA 



901 950 
EDEPAPKKQK LSASVKKEDL KSKHDSSAPD PAPDSADEVM PEAVPEVASE 



951 1000 
PGLESASHPN VDRTYFPGPP GDGHPEPSPL EDLSPCPASC GSGWTTVTV 



KIAA0333 
SEQ_ID_No3 

Consensus 

1001 1050 
KIAA0333 SGRDPRTAPS SSCTAVASAA SRPDSTHMVE ARQDVPKPVL TSVMVPKSIL 
SEQ_ID_No3 

Consensus 

1051 1100 
KIAA0333 AKPSSSPDPR YLSVPPSPNI STSESRSPPE GDTTLFLSRL STIWKGFINM 
SEQ_ID_No3 

Consensus 

1101 H50 
KIAA0333 QSVAKFVTKA YPVSGCFDYL SEDLPDTIHI GGRIAPKTVW DYVGKLKSSV 
SEQ_ID_No3 

Consensus 

1151 I 200 
KIAA0333 SKELCLIRFH PATEEEEVAY ISLYSYFSSR GRFGWANNN RHVKDLYLIP 

SEQ_ID_No3 

Consensus • 



1201 



1226 
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KIAA0333 LSAQDPVPSK LLPFEGPGKR RLSGWR 
SEQ_ID_No3 
Consensus 
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i 50 

KIAA0333 

SEQ_ID_No4 MDDKGHLSNE EAPKAIKPTS KEFRKTWGFR RTTIAKREGA GDTEADPSEQ 
Consensus 

51 100 

KIAA0333 

SEQ_ID_No4 QPQQHNLSLR RSGRQPKRTE RVEEFLTTVR RRGKKNVPVS LEDSSEPTSS 
Consensus 

101 150 

KIAA0333 

SEQ_ID_No4 TVTDVETASE GSVESSSEIR SGPVSDSLGK EHPASSEKAK GGEEEEDTSD 
Consensus 

151 200 

KIAA0333 

SEQ_ID_No4 SDSDGLTLKE LQNRLRRKRE QEPVERSLRG SQNRLRKKRR EEDSAETGSV 
Consensus 

201 250 
KIAA0333 ESKLEGKAA QDIKDEEPGD 

SEQ ID No4 QIGSAEQDRP LCKQEPEASQ GPVSQSETDD IENQLEGKAT QGNTEENPRE 
Consensus EnqLEGKAa Qdnk#E#Pr# 

251 300 

KIAA0333 LGRPKPECEG YDPNALYCIC RQPHNNRFMI CCDRCEEWFH GDCVGISEAR 

SEQ ID No4 AGKPKPECEV YDPNALYCIC RQPHNNRFMI CCDRCEEWFH GDCVGISEAR 

Consensus aGrPKPECEg YDPNALYCIC RQPHNNRFMI CCDRCEEWFH GDCVGISEAR 

301 350 
KIAA0333 GRLLERNGED YICPNCTILQ VQDETHSETA DQQEAKWRPG DADGTDCTSI 
SEQ ID No4 GRLLERNGED YICPNCTILQ VQDETNGSAT NEQDSGCRSV GADGTDCTS I 
Consensus GRLLERNGED YICPNCTILQ VQDETngeaa ##Q#agcRpg dADGTDCTSI 

351 400 

KIAA0333 GTIEQKSSED QGIKGRIEKA ANPSGKKKLK IFQPGPGPVP TQLPVLWQVL 

SEQ_ID_No4 GTVEQKSGED QGIKGRIEKA ANPSGKKKLK IFQP 

Consensus GT ! EQKSgED QGIKGRIEKA ANPSGKKKLK IFQP 

401 450 

KIAA0333 EIAVSRSISA FTLLHCISCK VIEAPGASKC IGPGCCHVAQ PDSVYCSNDC 

SEQ ID No4 WEAPGAPKC IGPGCSSVAQ PDSVYCSNDC 

Consensus VIEAPGApKC IGPGCchVAQ PDSVYCSNDC 

451 500 
KIAA0333 ILKHAAATMK FLSSGKEQKP KPKEKMKMKP EKPSLPKCGA QAGIKISSVH 
SEQ ID No4 ILKHAAATMR FLSSGKEQKT KPKEKVKTKP EKFSLPKCSV QVGIKISSVH 
Consensus ILKHAAATMr FLSSGKEQKp KPKEKmKmKP EKf SLPKCga QaGIKISSVH 

501 550 

KIAA0333 KRPAPEKKET TVKKAWVPA RSEALGKEAA CESSTPSWAS DHNYNAVKPE 

SEQ ID No4 KRLASEKREN PVKK-VMLAS RSETSGKEAA CESSTPSWAS DHNYNAVKPE 

Consensus KRlApEKrEn pVKK.Vmlaa RSEalGKEAA CESSTPSWAS DHNYNAVKPE 

551 600 

KIAA0333 KTAAP SP SLLYKSTKED RRSEEKAAAT AASKKTAPPG STVGKQP- -A 

SEQ_ID_No4 KPEKPTALSP TLLSKCTYHP KAGFPGPSHH LGGCLGLSRT RVLGVLVLIV 
Consensus KpaaP...SP sLLsKcTked rageegaaah aagclgaprg rtlGkqp. .a 
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650 

601 

KIAA0333 PRNLVPKKSS FANVAAATPA IKKPPSGFKG TIPKRPWLSA TPSSGASAAR 
SEQ ID No4 ASSSLPARSR YQDASGPQVF LPSLWSLSGW FLKSCVGLML EAISYFSFRP 
Consensus arnllParSr %a#aaaaqpa ikklpSgfgg fikkrpgLma eaiSgaSaar 

651 700 
KIAA0333 QAGPAPAAAT AASKKFPGSA ALVGAVRKPV VPSVPMASPA PGRLGAMSAA 

SEQ_ID_No4 W 

Consensus q 

701 750 
KIAA0333 PSQPNSQIRQ NIRRSLKEIL WKRVNDSDDL IMTENEVGKI ALHIEKEMFN 

SEQ_ID_No4 

Consensus 

751 800 
KIAA0333 LFQVTDNRYK SKYRSIMFNL KDPKNQGLFH RVLREEISLA KLVRLKPEEL 

SEQ_ID_No4 

Consensus 

801 850 
KIAA0333 VSKELSTWKE RPARSVMESR TKLHNESKKT APRQEAIPDL EDSPPVSDSE 

SEQ_ID_No4 

Consensus 

851 900 
KIAA0333 EQQESARAVP EKSTAPLLDV FSSMLKDTTS QHRAHLFDLN CKICTGQVPS 

SEQ_ID_No4 

Consensus 

901 950 
KIAA0333 AEDEPAPKKQ KLSASVKKED LKSKHDSSAP DPAPDSADEV MPEAVPEVAS 

SEQ_ID_No4 

Consensus 

951 1000 
KIAA0333 EPGLESASHP NVDRTYFPGP PGDGHPEPSP LEDLSPCPAS CGSGWTTVT 

SEQ_ID_No4 

Consensus 

1001 1050 
KIAA0333 VSGRDPRTAP SSSCTAVASA ASRPDSTHMV EARQDVPKPV LTSVMVPKSI 

SEQ_ID_No4 

Consensus 

1051 1100 
KIAA0333 LAKPSSSPDP RYLSVPPSPN ISTSESRSPP EGDTTLFLSR LSTIWKGFIN 

SEQ_ID_No4 

Consensus 

1101 1150 

KIAA0333 MQSVAKFVTK AYPVSGCFDY LSEDLPDTIH IGGRIAPKTV WDYVGKLKSS 
SEQ_ID_No4 

Consensus 

1200 

1151 

KIAA0333 VSKELCLIRF HPATEEEEVA YISLYSYFSS RGRFGWANN NRHVKDLYLI 
SEQ_ID_No4 

Consensus 
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1201 1227 
KIAA0333 PLSAQDPVPS KLLPFEGPGK RRLSGWR 
SEQ_ID_No4 

Consensus 
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